Acquiring Item Difficulty Estimates: a Collaborative Effort of Data and Judgment. Nominee for Best Paper Award
نویسندگان
چکیده
The evolution from static to dynamic electronic learning environments has stimulated the research on adaptive item sequencing. A prerequisite for adaptive item sequencing, in which the difficulty of the item is constantly matched to the knowledge level of the learner is to have items with a known difficulty level. The difficulty level can be estimated by means of the item response theory (IRT), as often done prior to computerized adaptive testing. However, the requirement of this calibration method is not easily met in many practical learning situations, for instance, due to the cost of prior calibration and due to continuous generation of new learning items. The aim of this paper is to search for alternative estimation methods and to review the accuracy of these methods as compared to IRT-based calibration. Using real data, six estimation methods are compared with IRT-based calibration: proportion correct, learner feedback, expert rating, paired comparison (learner), paired comparison (expert) and the Elo rating system. Results indicate that proportion correct has the strongest relation with IRT-based difficulty estimates, followed by learner feedback, the Elo rating system, expert rating and finally paired comparison.
منابع مشابه
A New Similarity Measure Based on Item Proximity and Closeness for Collaborative Filtering Recommendation
Recommender systems utilize information retrieval and machine learning techniques for filtering information and can predict whether a user would like an unseen item. User similarity measurement plays an important role in collaborative filtering based recommender systems. In order to improve accuracy of traditional user based collaborative filtering techniques under new user cold-start problem a...
متن کاملنتایج خود ارزیابی براساس مدل جایزه ی ملی کیفیت ایران دربیمارستان مرکزی صنعت نفت؛ 1385 (INQA)
Introduction: The healthcare organization directors should utilize the quality management tools properly to improvement healthcare services. The is a crucial tool to measure achievement to organization quality goals ,and to improvement performances. The Iran National Quality Award is the most comprehensive model for performance assessment. Methods: This is a descriptive case assessment, applied...
متن کاملA Model-Driven Decision Support System for Software Cost Estimation (Case Study: Projects in NASA60 Dataset)
Estimating the costs of software development is one of the most important activities in software project management. Inaccuracies in such estimates may cause irreparable loss. A low estimate of the cost of projects will result in failure on delivery on time and indicates the inefficiency of the software development team. On the other hand, high estimates of resources and costs for a project wil...
متن کاملInvestigating the Impact of Response Format on the Performance of Grammar Tests: Selected and Constructed
When constructing a test, an initial decision is choosing an appropriate item response format which can be classified as selected or constructed. In large-scale tests where time and finance are of concern, the use of response chosen known as multiple-choice items is quite widespread. This study aimed at investigating the impact of response format on the performance of structure tests. Concurren...
متن کاملExamining the Features of an English Language Test: Reliability-related Issues
Most universities across Iran tend to develop English tests for placement, exit, achievement, and other purposes. Examining the various features of such tests is imperative for making informed decisions about learners’ achievement level. The present study examined the features of a university-wide administered English language achievement test at Iran University of Science and Technology (IUST)...
متن کامل